This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 



BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the applicant 

Defects in the images may mda^ ^s^^yt^tsdto): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY, 

As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



per 



WORLD INTELLECTUAL PROPERTY ORGANCZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERA TION TREATY (PCI) 

(li) International Publication Number: WO 93/20216 

(43) International Publication Date: 14 October 1993 (14. 10.93) 



(51) International Patent Classification 5 : 
C12N 15/82, 15/29, 15/62 
C12P 21/02, C12N5/10 
A01H5/00 



Al 



(21) International Application Number: PCT/CA93/00141 



(22) International Filing Date: 



2 April 1993 (02.04.93) 



(30) Priority data: 

0V862.355 2 April 1992 (02.04.92) 

PCT/CA92/00I6I 15 April 1992 (15.04.92) 
(34) Countries for which the regional 

or international application 

was filed: 



US 
WO 



CAetal 



(60) Parent Applications or Grants 
(63) Related by Continuation 
US 

Filed on 
US 

Filed on 



PCT/CA92/00161 (CIP) 
15 April 1992(15.04.92) 
07/862,355 (CIP) 
2 April 1992(02.04.92) 



(71) Applicant (for all designated States except US); UNIVERSI- 
™ TC t£^o°^ GIES INTERNATIONAL, INC. [CA/ 

Sn In\ \h% Uospltzl Drive RW - ^ M - 



(72) Inventor; and 
(75) Inventor/Applicant (for US only) ; MOLONEY Mauriro 

jf-fifgfjaj! Edgehni Sace - ^^wSse 

(74) Agenb. MORROW. Joy D. ct aL; Smart & Biggar, 900 - 
55 Metcalfe Street, P.O. Box 2999, Station D Ottawa. 
Ontario KIP 5Y6 (CA). ' utiawa > 

(81) Designated States: AT, AU, BB, BG, BR. CA CH C7 
DE, DK, ES, FI, GB, HU, JP. Kp', & 2?' £u 

sTuA N bs W ^ N0 ' ^ ^ ^SS. SD,sS 
Published 

international search report 
Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 



(54) Title: OIL-BODY PROTEIN CIS-ELEMENTS AS REGULATORY SIGNALS 



ANY CONVENIENT 

Nco1 BspHl RESTRICTION SITE 

ATG | 



Pstl / — rfTM 

Smal OLEOSIN REGULATORY FOREIGN DMA TO rp ~" 

BcSm SEQUENCE EXr^STO OLEOSIN TERMINATOR 

Spot 
Xbaf 
Nof1 

BstXI pPAW4 

OQCI 



AMPICILLIN RESISTANCE 



(57) Abstract 



FOR THE PURPOSES OF INFORMATION ONLY 



appUcaS^fuSr^PCT^ ^ m " ^ PCT ° n ^ ° f P * mpMelS puWishin « 



AT A loir la 

AU Australia 

6B Barbados 

BE Belgium 

8F 8urfcina Faso 

BG Bulgaria 

BJ Benin 

BR Brazil 

CA Canada 

CF Central African Republic 

CC C ongo 

CH Switzerland 

CI Cote d*l voire 

CM Cameroon 

CS Oechuslovakiu 

CZ Oecli Kcpuhlic 

DG Germany 

OK Denmark 

ES Spain 

Ft Finland 



FR France 

CA Gabon 

CB United Kingdom 

GN Guinea 

GR Greece. 

HO Hungary 

IE Ireland 

IT Italy 

JP Japan 

KP Democratic People** Republic 

of Korea 

KR Republic or Korea 

KZ KazaLhstan 

1.1 Liechtenstein 

LK Sri tanLa 

I4J Ijuurnihourg 

MC Monaco 

MC Madagascar 

ML Mali 

MN Mongolia 



MR Mauritania 

MW Malawi 

NL Netherlands 

NO Norway 

NZ New. Zealand 

PL Poland 

PT Portugal 

RO Romania 

RU Russian Federation 

SO Sudan 

SE Sweden 

SK Slovak Republic 

SN Senegal 

SU Soviet Union 

TO Chad 

TC Togo 

UA Ukraine 

US United States of America 

VN Viet Nam 



WO 93/20216 PCT/CA93/00141 



10 



15 



20 



25 



30 



OIL-BODY PROTEIN CIS-ELEMENTS 
AS REGULATORY SIGNALS 

INTRQDT TPT fn NT 



Technical Field 

This invention relates to upstream DNA sequences and their use to control 
expression of genes in developing plant seeds and their use. 

Background 

Studies in plant gene expression have yielded a number of general 
conclusions concerning the elements that control expression. Plants, like other 
organisms both prokaryotic and eukaryotic, contain conserved or consensus 
sequences upstream (5') of the transcriptional start site of genes which appear 
capable of regulating transcriptional rates. In eukaryotes, these sequences include 
a motif found typically about 25 bp 5' to the transcriptional initiation site which 
has the sequence TATAA/TAA/T and is referred to as a TATA box. The role of 
this TATA box appears to be to define the transcriptional start for RNA 
polymerase II. A second upstream sequence is referred to as a CAAT box. 
Typically, this is found about.75 bases upstream of the transcriptional start and is 
associated with regulating the frequency of transcriptional initiation. In plants the 
consensus sequence may be either CCAAT or sometimes AGGA. However, 
neither of these alternative consensus sequences need be present in all plant genes. 
These sequence motifs and their DNA context within 70-90 bases upstream of the 
transcriptional start are often referred to as promoters. In general, 5' of the 
promoter region and most frequently within 2000 bases of it are cis-acting 
elements which confer a variety of properties on the promoter and which can 
modulate transcriptional activity in either a constitutive or a non-constitutive 
manner. These cis-acting sequences may be referred to as enhancers (if they are 
responsible for increases in transcription) or silencers (if they are responsible for 
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decreases or suppression of transcription). Enhance™ a nH *w« — 



30 



the sites at which nuclear proteins bind or interact. The modulating nuclear 
proteins are called trans-acting factors. They are considered to be very imporfcnt 
for non^r*titutive or regulated egression as they may be the major determinant 
of the activity of a gene in a particular tissue or organ or in response to an 
external stimulus. The relationship between this protein binding and the 
enhancer/silencer element may determine the transcriptional activity. The isolation 
of genes which are activated by heat, light or chemicals such as endogenous 
hormones or are activated in specific organs such as seeds, leaves or flowers has 
perrmttedanalysisof^ ^ 
numerous, but not all, cases, it has been shown that the construction of chimeric 
genes which contain the promoter and optionally cis-elements from a given 
regulated gene and a coding sequence of a reporter protein not nonnally associated 
w^ttet^motergr^ 

promoters from seed-specific genes for the expression of sequences in seed of 
genes that are either not normally expressed in a seed-specific manner or those 
that require an altered pattern of expression has been attempted on only a few 
occasions. In all cases to date, chimeric genes designed for seed-specific 
expression have used seed-storage protein regulatory signals and promoters 
However, it is evident from work on storage protein gene expression that 
expression commences at a fairly late stage in embryogenesis, namely once the 
embryo has reached Cm the case of dicots) the classical torpedo shape. Thus, 
although storage proteins express at high levels and their regulation is often ' 
transcriptional, the timing and level of expression may not be ideal for all seed- 
specific appfications. Itis, therefore, of interest to identify other seed-specific 
promoters and enhancers with temporal or cellular specificity different from that of 
seed storage proteins, such as those from oleosins. 



Relevant literature 

The following disclose organ or tissue-specific regulatory sequences 
used to produce tissue- or organ-specific expression in transformed plants There 
are several by now "classical- examples of regulated gene expression in non-seed 
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protein chloramphenicol acetyl transferase could be expressed in a light-regulated 
and organ-specific manner in transgenic plants if the coding sequence for the 
reporter protein was fused with the promoter and upstream sequences from a pea 
gene encoding ribulose bisphosphate carboxylase (Huhr, Science (1986), 
232:1106-1112). 

Sengupta-Gopalan et al. Proc. Natl. Acad. Sci. USA, (1985) 
82:3320-3324 reported expression of a major storage protein of french beans, 
called JJ-phaseoIin, in tobacco plants. The gene expressed correctly in the seeds 
and only at very low levels elsewhere in the plant. However, the constructs used 
by Sengupta-Gopalan were not chimeric. The entire B-phaseolin gene including 
the native 5'-flanking sequences were used. Subsequent experiments with other 
species (Radke et al. (1988) Tfaeor. App. Genet 75:685-694) or other genes 
(Perez-Grau, L., Goldberg, R.B., 1989, Plant Cell, 1:1095-1109) showed the 
fidelity of expression in a seed-specific manner in both Arabidopsis and Brassica. 
Radke et aL (1988), vide supra, used a "tagged" gene i.e., one containing the 
entire napin gene plus a non-translated "tag". 

In tissue and organ specific expression there have been several 
examples showing that sequences upstream of the transcriptional start may be used 
to confer tissue/organ specificity to a gene introduced into plants by genetic 
engineering. Examples include engineering seed-specific gene regulation (Radke 
et al. (1988) vide supra; Bustos et al. (1989), Plant Cell, 1:839-853). Li both 
examples, sequences upstream of the coding sequences of seed proteins were 
linked to a reporter tag (either as RNA or protein) and seed specificity was 
conferred on expression of the reporter. These were all storage protein genes 
rather than oleosins. Seed storage proteins have different temporal expression 
patterns from oleosins. 

The DNA motifs that might give rise to seed-specific expression are 
now the subject of many studies. Marcotte et al. (Marcotte, W.R., Russel, I.S., 
Quantrano, R.S., 1989, Plant Cell, 1:969-976) studied the Em gene of wheat ana 
proposed two motifs called "Em-boxes" which might be consensus sequences for 
seed-specific expression. Interestingly, one of these boxes called EM-2 is similar 
to that found in other storage protein genes from monocots (triticin-wheat) and 



W093/20216 PCT/CA93/0014I 

4 

even dicots (B^onglycinin-soybean). Hatzopoulos et aL (1990, Plant Cell, 2^457- 
467) investigated the sequences directing embryo-specific expression of a carrot 
lipid-body protein gene. A number of AT rich motifs were identified, being 
protected from digestion during DNAse treatment presumably by trans-acting 
5 proteins. The motifs identified, however, were not shown to be consensus motifs 
for other seed-specific genes. 

DeClercq et aL Plant PhysioL, (1990), 94:970-979 used the 
promoter of the Arabidopsis 2S albumin and combined coding sequences from both 
the Arabidopsis and Brazil nut 2S albumins. Fusions were made in regions 
10 showing low conservation. Transformation of both tobacco and Brassica napus 

yielded seed-specific expression and correct accumulation of the modified storage 
proteins. Levels of expression were between 0.05% and 0.3% of total cellular 
protein. 

Another example of this form of seed-specific expression of foreign 
15 sequences was the expression of leu-enkephalins in seeds. To obtain seed specific 

expression, a chimeric DNA sequence encoding a 2S albumin and a short 
oligonucleotide encoding leu-enkephalin (a pentapeptide) was included in the 
albumin coding sequences between the 6th and 7th cysteines of the native protein 
(Vanderkerhove et aL Biotechnology, (1989) 2:929-932). Again this gene 
20 expressed in a seed-specific manner allowing the accumulation of up to 50 nmol 

leu-enkaphalin per g of seed. 

Genomic clones encoding oil-body proteins with their associated 
upstream regions have been reported for two species, maize (Zea mays, Bowman- 
Vance and Huang, (1987) J. BioL Chem., 262:11275-11279; and Qu and Huang, 

25 (1990) J. BioL Chem., 26^:2238-2243) and carrot (Hatzopoulos et al. (1990) Plant 

Cell, 2:457-467). cDNAs and genomic clones have also been reported for one 
cultivated oilseed, Brassica napus (Murphy, et al. (1991), Biochem. Biophys. 
Acta, JflSS:86-94; and Lee and Huang (1991) Plant Physico 96:1395-1397.) 
Reports on the expression of these oil-body protein genes in developing seeds have 

30 varied. In the case of Zea mays, transcription of genes encoding oil-body protein 

isoforms began quite early in seed development and were easily detected 18 days 
after pollination. In non-endospermic seeds such as the dicotyledonous plant 



after pollination. In non-endospennic seeds such as the dicotyledonous plant 
Brassica napus (Canola), expression of oil-body protein genes seems to occur 
much later in seed development (Murphy, et al. (1989), Biochem. J., 25.8:285- 
293) than with corn. 



SUMMARY OF THE INVENTTOM 
Methods and compositions are described for the exploitation of an 
oil-body protein transcriptional regulatory sequence and optionally its 
accompanying 5' untranslated leader sequence for the expression of heterologous 
genes in a seed-specific manner. The method includes the steps of transforming < 
plant cell with a DNA construct comprising the regulatory sequence and a DNA 
sequence other than the open reading frame native to the regulatory sequence, 
generating a plant from the transformed cell and growing it under conditions 
whereby seed is produced and the DNA sequence is expressed under the 
transcriptional control of the regulatory region. These sequences will be valuable 
in applications where expression of a seed-borne product needs to be modified, 
enhanced or suppressed. They could also be used to produce modified seeds 
containing foreign proteins to increase the intrinsic value of the seed. 



BRIEF DESHRTPTTfW OF THE PR AWTWfi<! 

Fig. 1 shows a schematic diagram of vector pPAW4 enclosing an 
oleosin regulatory sequence, an initiation codon, foreign DNA to be expressed, an 
oleosin terminator sequence and an ampicillin resistance gene. 

Fig. 2A shows the DNA sequence of an Arabidopsis genomic clone 
encoding a 18KDa oil-body protein. The open reading frame is interrupted by a 
short intron (which is marked) and the two exons are translated and indicated in 
IUPAC single letter amino-acid codes. 

Fig. 2B shows the restriction fragment from an Agrobacterium 
EMBL3 genomic library which encloses the Arabidopsis 18KDa oil-body protein 
coding sequence. The approximate position of the coding region is highlighted. 

Fig. 3 shows the effects of 10/zM ABA on the developmental 
expression of oleosin mRNA using a Northern blot analysis of total RNA. A) 
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(70/tg per lane) using 50 ng«P dCTP labelled OB990 as a probe (spec, act 10 9 
dpm/^g DNA). Heart-(H): (13-day), torpedo-CD (17-day), and wtyledonary -(C) 
(21-25 day) stage microspote^erived embryos with (+) and without (-) treatment 
for 48 h with 10/iM ABA. The blot was exposed to Kodak XAR5 film at 70°C 
5 for 20 minutes. The apparent size difference of the mRNAs in the different lanes 

is due to interfering quantities of starch in the different mRNA preparations. AH 
the lanes were equally loaded as judged by OD260 measurements and EtBr- 
staining. B) A 4.5 hour exposure of Fig. 3-A Q Relative intensity of the mRNA 
accumulation as determined by scanning densitometry. 
10 Kg- 4 shows me tissue specificity of oleosin. 50 fig of poly (A)+ 

RNA of roots (R), callus (Ca), Cotyledons (Co), leaves (L), and 24-day post- 
anthesis zygotic embryos (E) was probed with 50 ng of 32 P dCTP labelled OB990 
(spec. act. 10 8 dpmfpg DNA). 

Kg. 5 shows the developmental sensitivity of oil body protein 
15 synthesis to applied ABA. An estimated 10,000 dpm were loaded per well for 

paired samples of controls (lanes A,C,E,G) and ABA-treated (lanes B,D,F,H). 
All samples were treated for 2d with ABA, then labeled for 4 h with 1.85 ' 
MBq/mL ^methionine. Lanes A and B, 10-d-old cultures, sieved on 62^m 
screens to obtain globular embryos. Lanes C and D, 13-d-old cultures sieved on 
20 125/tm screens to obtain heart stage embryos. Lanes E and F, 17-d-old cultures, 

sieved on 250/un screens to obtain torpedo to early cotyledonary embryos. Lanis 
G and H, 25-d-old cultures, sieved on 500 M m screens to obtain cotyledonary stage 
embryos. 
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BRIEF DESCRIPTION OF TTTR SPPrrprr RUROnu^rrc 
In accordance with the subject invention, DNA constructs are 
provided which allow for modulation of plant phenotype in seed, particularly 
during early phases of embryogenesis. The DNA constructs provide for regulation 
of transcription in seed, using 5' untranslated sequences from genes active from 
the late globular stage through to embryo maturity (cotyledonary stage). 
Downstream from and under transcriptional initiation regulation of an oil body 
protein gene initiation region will be a DNA sequence of interest which will 
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be prepared which allow for integration of the transcription cassette into the 
genome of a plant cell. Conveniently, a multiple cloning site downstream from 
the seed specific transcriptional initiation region may be included so that the 
integration construct may be employed for a variety of DNA sequences in an 

5 efficient manner. 

Of particular interest is a regulatory sequence from an oil body 
protein gene, preferably an oil body protein gene expressed in dicotyledonous oil 
seeds. It has been reported that oil-body proteins accumulate considerably later 
than either oils (triacylglycerides) or storage proteins. TUi later expression would 
10 limit the value of any promoters associated with these genes for seed-specific 

expression as they could not be used for modification of expression of genes 
during early phases of embryogenesis. Surprisingly, however, expression of these 
genes in dicotyledonous oilseeds was found to occur much earlier than had hitherto 
been believed. Thus, the promoters and upstream elements of these genes are 
valuable for a variety of uses involving the modification of metabolism during 
Phases of embryogenesis which precede the accumulation of storage proteins. 

Oil-body proteins have been identified in a wide range of 
taxonomicaUy diverse species (see, for example, Moreau et al. Plant Physiol 
(1980), ££1176-1180; Qu et al. Biochem. J., (1986) 225:57-65). These proteins 
are uniquely localized in oil-bodies and are not found in organelles of vegetative 
tissues. In Brassica napus (rapeseed) there are at least three polypeptides 
associated with the oil-bodies of developing seeds (Taylor et al. (1990), Planta, 
131:18-26). The numbers and sizes of oil-body associated proteins may vary from 
species to species, m com, for example, there are four immunologicaUy distinct 
25 polypeptides found in oil-bodies (Bowman-Vance and Huang, 1988, J. Biol 

Chem., 261:1476-1481). Oleosins have been shown to comprise regions of 
alternate hydrophilicity, hydrophobic^ and hydrophilicity (Bowman-Vance and 
Huang, 1987, J. Biol. Chem., 262:11275-11279). The amino acid sequences of 
oleosins from corn, rapeseed and carrot have been obtained. See Qu and Huang, 
1990, J. Biol. Chem., 265:2238-2243, Hatzopoulos et al. 1990, Plant Cell, 2:457- 
467, respectively. In an oilseed such as rapeseed, oleosin may comprise between 
8% (Taylor et al. 1990, Planta, 181:18-26) and 20% (Murphy et al. 1989, 
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Biochem. J., 251:285-293) of total seed protein. Such a level is comparable to 
that found for many seed storage proteins. 

Of particular interest is a transcriptional initiation region associated 
with early embryogenesis, particularly the period preceding expression of storage 
5 proteins, so that in the early development of seed, it provides the desired level of 
transcription of the DNA sequence of interest Normal plant embryogenesis 
typically goes through a series of defined phases. For dicotyledonous seeds, 
embryogenesis includes the following phases: globular stage, heart stage, torpedo 
stage, and cotyledonary stage. For the purposes of this application, the definition 

10 of these terms is provided by Ray, Steves, and Fultz in Botany, (Saunders College 

Publishing), Chapter 17, page 294. Normally, the transcriptional initiation region 
will be obtainable from a gene which is expressed in the early formation of seed. 
Desirably the transcriptional initiation region maintains its activity from the late 
globular through, cotyledonary stage, more desirably continues active from the 

15 globular stage through the heart, torpedo and cotyledonary stages of 

embryogenesis. By obtainable is intended a transcriptional initiation region having 
a nucleotide sequence sufficiently similar to that of a natural oil body protein gene 
transcriptional initiation region sequence to provide for transcription in the early 
formation of seed. The sequence may be naturally occurring, synthetic or partially 

20 synthetic. 

The transcriptional initiation region from the oil body protein 
generally will be provided in a cassette which will include in the 5 '-3' direction of 
transcription, a transcriptional initiation region, a DNA sequence of interest and a 
transcriptional termination region, wherein the transcriptional regulatory regions 

25 are operably joined and functional in plant cells. One or more introns may also be 

present. After each manipulation, a DNA to be used in the final construct may be 
restricted and operably joined to other DNA to be used in the final construct, 
where each of the partial constructs may be cloned in the same or different 
plasmids. In a preferred embodiment, a coding sequence with a compatible 

30 restriction site may be ligated at the position corresponding to codon #1 of the oil- 

body protein gene. A schematic diagram of this substitution is shown in figure 1. 
The recombinant coding sequence may be inserted in such a way that it completely 



10 



15 



20 



25 



30 



WO 53/20216 

PCT/CA93/00141 

9 

replaces the coding sequence of the oil-body protein gene and is thus flanked at its 
3* end by the oil-body protein gene terminator and polyadenylation signal. 
Alternatively, polymerase chain reaction amplification may be carried out to 
produce DNA fragments containing the transcriptional initiation region 
conveniently flanked by restriction sites. The amplified fragments can be joined to 
the coding sequence for a polypeptide of interest, in a transcriptional or 
translations fusion, for example, to produce a chimeric gene in which the coding 
sequence of the polypeptide of interest is transcribed under the control of the 
transcription initiation region on the PCR amplified fragment. 

The transcriptional initiation region may be native to or homologous 
to the host cell, or foreign or heterologous to the host cell. By foreign is intended 
that the transcriptional initiation region is not found in the wild-type host into 
which the construct comprising the transcriptional initiation region is inserted. 
Generally, the regulatory sequence comprises DNA of up to 1.5 Kb 5' of the 
translational start of an oil^ody protein gene. This sequence may be modified at 
the position corresponding to the first codon of the desired protein by site-directed 
mutagenesis (Kunkel TA, 1985, Proc. Nad. Acad. Sci. USA, 32:488-492) or by 
introduction of a convenient linker oligonucleotide by ligation if a suitable 
restriction site is found near the N-terminal codon. 

In some cases it will be desirable to express the DNA sequence of 
interest as a fusion protein, particularly as a fusion protein with the oil body 
protein. The DNA sequence of interest can be inserted by routine techniques into 
the oil body protein coding sequence, in frame with the oil body protein coding 
sequence, such that transcription of the chimeric gene will produce a fusion 
protein. The fusion protein will preferably contain the coding region for amino 
acids number 44 through 122 in the Arabidopsis oil body protein as shown in 
Figure 2A, or the equivalent region from an oil body protein of a species other 
than Arabidopsis. to provide for transport to the oil body in cases where this is 
desirable. 

In order to isolate oil body protein coding sequences from other 
species, at least two approaches may be used. The first is to use the Arabidopsis 
clone described in the Examples as a probe in genomic libraries of other plant 
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species. This clone will hybridize well with oleosin clones frnm *wi„ 
species, in particular, essentially all cruciferous plants. For species which are 
evomtionarily divergent from Arabidopsis, for example, solanaceae, leguminaceae 
and an monocotyledons, an alternative method involves the use of an antibody 
5 raised against me gene product of an oleosin clone such as the Arabidopsis clone. 

This antibody may be used to screen a seed-derived cDNA expression library, for 
example using lambda gtll; Huynh et al. (1985) in cDNA Cloning, VoL 1, A 
Practical Approach, Ed. Graver IRL Press, pp. 49-78. This approach yields a 
cDNA clone of the oleosin for the new species which may then be used to isolate 

10 the genomic clone from a genomic library of that species by standard DNA 

hybridization techniques. 

The DNA sequence of interest may be any open reading frame 
encoding a peptide of interest, for example, an enzyme, or a sequence 
complementary to a genomic sequence, where the genomic sequence may be at 

15 least one of an open reading frame, an intron, a non-coding leader sequence, or 

any other sequence where the complementary sequence will inhibit transcription, 
messenger RNA processing, for example splicing or translation. The DNA 
sequence of interest may be synthetic, naturally derived or a combination thereof. 
Depending upon the nature of the DNA sequence of interest, it may be desirable 

20 to synthesize the sequence with plant preferred codons. The plant preferred 

codons may be determined from the codons of highest frequency in the proteins 
expressed in the largest amount in the particular plant species of interest 

The DNA sequence of interest may encode any of a variety of 
recombinant proteins. Examples of recombinant proteins which might be 

25 expressed by this procedure include anticoagulants, such as Hirudin, lymphokines 

such as those of the interleukin family, peptide hormones such as gonadotrophs 
releasing hormone, immunological reagents such as multi or single-chain 
antibodies and a variety of industrial valuable enzymes such as proteases, Upases 
and polyglucan hydrolases. 

30 "H* tennination region which is employed will be primarily one of 

convenience, since the termination regions appear to be relatively interchangeable. 
The termination region may be native with the DNA sequence of interest, or may 
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be derived from another source. Convenient termination regions are available and 
include the 3' end of the oil body protein gene terniinator and polyadenylation 
signal from the same gene from which the 5' regulatory region is obtained. 
Alternatively, a different terminator and polyadenylation signal may be employed 
5 with similar results, for example, the terminator of the nopaline synthase gene of 

Agrobacterium. 

The expression cassette may additionally contain a means for 
identifying transformed cells and/or selecting for transformed cells. For example 
the recombinant gene may be linked with a constitutively expressed selectable 
10 marker such as a gene for antibiotic resistance or herbicide resistance or a 

screenable marker, such as a gene conferring bioluminescence or colored 
properties to transformed cells. 

The DNA sequence of interest flanked at its 5' end by the oil-body 
protein promoter and regulatory sequences and at its 3' end by a terminator may 
15 be introduced into a suitable transformation vector including Agrobacterium Ti or 

binary plasmids, or a simple cloning plasmid (e.g., pTJC19, pBR322) for use in 
direct DNA uptake to plant cells via microinjection, electroporation, PEG- 
mediated uptake or a biolistic method. These methods are well known to those 
skilled in the art of plant transformation. See, for example, Horsch et al. (1985), 

20 Science, 227:1229-1231; Newhaus and Spangenberg (1990). Physiol. Plant, 

79:213-217; and Sandford et al. (1990), Physiol. Plant, 79:206-209. 

Transformed plants may be obtained from the transformed cells 
using standard regeneration protocols (see for example: Moloney et al. (1989), 
Plant Cell Rep., 8_:238-242) compatible with the transformation method. 

25 The expression cassette, constructed as described above, expresses 

essentially preferentially in developing seeds. The plant cells which have been 
transformed with an appropriate fusion peptide therefore are grown into plants in 
accordance with conventional ways and allowed to set seed. See, for example, 
McCormick et al., Plant Cell Rep. (1986) 5:81-84.. Two or more generations may 

30 be grown and either pollinated with the same transformed strain or different 

strains, identifying the resulting hybrid having the desired phenotypic 
characteristic, to ensure that the subject phenotypic characteristic is stably 
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maintained and inherited and then seeds harvested for isnlarinn nf ^m. ~t 

^/vpUUV VfX 

interest or for use to provide seeds with the new phenotypic property. The 
regenerated plants are then cultivated identically to non-recombinant plants in 
growth chambers, greenhouses or in the field and will show seed-specific 
5 expression of the recombinant gene at the mRNA level and often at the level of 

polypeptide or protein. 

It is possible mat the polypeptide/protein will itself be valuable and 
could be extracted and, if desired, further purified. Alternatively the 
polypeptide/protein or even the mRNA itself may be used to confer a new 
10 biochemical phenotype upon the developing seed. New phenotypes could include 

such modifications as altered seed-protein or seed oil composition, enhanced 
production of pre-existing desirable products or properties and the reduction or 
even suppression of an undesirable gene product using antisense, ribozyme or 
co-suppression technologies (Leant and Weintraub (1984), Cell 36: 1007-1015, 
15 antisense; Bazelhoff and Gerlach (1988), Nature 334:585-591, ribozyme; Napoli, 

et aL (1990), Plant Cell, 2:279-289, co-suppression). 

If the transformation has been performed to produce a new seed 
protein or peptide which requires extraction, this can be done using aqueous 
extraction with or without low concentrations of detergents, such as non- 
20 denaturing amounts of sodium dodecyl sulphate (SDS), Triton-X-100, Tween 20, 
MEGA-8 or any other detergent known not to irreversibly inactivate the desired 
protein. To extract the protein or polypeptide, dry seeds are ground by hand or in 
a mechanical grinder to produce an aqueous slurry or suspension. This can be 
resolved into three phases (particulate, aqueous soluble, and hydrophobic) by 
25 centrifugation, such as at 50,000 x g. Depending upon the nature of the product, 
it may be further purified in each of these phases and after solublization, may be 
selectively precipitated by the use of ammonium sulfate or purified using column 
chromatography, for example, using ion exchange, gel filtrates or affinity 
matrices. 

30 Wttfe me idea l host for the regulatory sequence reported here would 

be a cruciferous plant, it is possible to use these promoters in a wide variety of 
plant species given the relatively high conservation oleosin of genes. The major 
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barrier to the use of these promoters is between monocotyledonous and " 
dicotyledonous species. For transformations involving this specific expression on 
a monocot, a monocot olesin regulatory sequence should be used. For dicot seed- 
specific expression, a dicot oleosin regulatory sequence should be employed. The 
reported sequence can be used in a wide variety of dicotyledonous plants, 
including all members of the Brassica genus and crucifers in general. So'lanaceous 
plants, such as tobacco and tomato, also recognize the sequences and show correct 
regulation of expression in developing seeds. 

It is expected that the desired proteins would be expressed in all 
embryonic tissue, although different cellular expression can be detected in different 
tissues of the embryonic axis and cotyledons. This invention has a variety of uses 
which include improving the intrinsic value of plant seeds by their accumulation of 
altered polypeptides or novel recombinant peptides or by the incorporation or 
eumination of a metabolic step, m its simplest embodiment, use of this invention 
may result in improved protein quality (for example, increased concentrations of 
essential or rare amino acids), improved liquid quality by a modification of fatty 
acid composition, or improved or elevated carbohydrate composition. Examples 
include the expression of sulfur-rich proteins, such as those found in lupins or 
brazil nuts in a seed deficient in sulphurous amino acid residues. Alternatively a 
fatty acyl coenzyme A (COA) a transferase enzyme capable of modifying fatty ' 
acid ratios in triglycerides (storage lipid) could be expressed. In cases where a 
recombinant protein is allowed to accumulate in the seed, the protein could also be 
a peptide which has pharmaceutical, industrial or nutritional value. In this case 
the peptide could be extracted from the seed and used in crude or purified form' 
as appropriate for the intended use. The protein could be one truly foreign to tne 
Plant kingdom, such as an animal hormone, enzyme, lymphokine, anticoagulant 
or the like could be expressed in seed. The heterologous protein could then be ' 
extracted from the seeds and used for experimental, nutritional or pharmaceutical 
purposes after partial or complete purification. 

The following examples are offered by way of illustration and not 

by limitation. 
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EXAMPLES 

Example. 1 

The oil body protein gene from Arabidopsis was isolated on a 15kb 
insert present in a clone from an Arabidopsis thaliana v. Columbia genomic 
5 Ubrary in phage XEMBL3A by hybridization to a B. na/>ur oleosin clone. A 

1.8kb fragment containing approximately 868 base pairs 5* of the oleosin protein 
translation^ start was subcloned into a plasmid vector. The Arabidopsis 18 KDa 
oleosin gene is conveniently cloned as a 1803 bp fragment flanked by Ncol and 
Kpnl sites in a vector called pPAW4 (see Figure 1). Li order to convert the 

10 fragment into an expression cassette for general use with a variety of 

foreign/alternative genes, two modifications must be made. Firstly, using the 
technique of site-directed mutagenesis (Kunkel, supra) mutations at positions -2, -1 
and +4 are introduced using a mis-matched oligonucleotide. The mutations 
required are A to T (-2), A to C (-1) and G to A (+4). These mutations have the 

15 . effect of creating a BspHl site at positions -2 to +4. The BspHl site (T/CATGA) 
encloses the ATG initiation codon and gives a recessed end compatible with an 
Ncol cut. A second modification involves digestion with EcoRV and Mscl which 
releases a 658 bp fragment containing most of the coding sequence of the native 
oleosin. This leaves blunt ends at the cut sites which on separation of the vector 

20 and an ancillary sequence from the EcoRV-Mscl fragment, permits 

recircularization of the vector-promoter-terminator combination. This 
lecircularization is performed in the presence of an oligonucleotide linker 
containing restriction sites not found in the original 1803 Kb fragment 
On reckcularization, a plasmid containing all the upstream 

25 sequences of the oleosin gene, a transcriptional start site and an initiation codon 

embedded in a BspHl site is obtained. Thirty-one bases downstream of this is a 
short polylinker containing one or more unique restriction sites. To introduce any 
DNA sequence into this cassette the foreign sequence should have, or should be 
modified to contain, a BspHl or Ncol site at the initial ATG position. For 

30 sequences to be expressed as proteins this will assure conservation of the distance 

between the "cap" site and the initiator codon. 
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The DNA sequence to be inserted should terminate with a cohesive 
end of a restriction site not found on the plasmid. The polylinker interposed into 
the expression cassette may be chosen with this site in mind. Digesting the 
plasmid with BspHl and the appropriate restriction enzyme for the 3' end of the 
foreign sequence will ensure that a directional cloning of the desired DNA 
fragment may be effected. Using appropriate ligation conditions, the plasmid 
expression cassette with BspHl and a site compatible with the desired DNA 
fragment are incubated together to produce a Iigated product as shown in Figure 1. 

The complete construct from Ncol-Kpnl is now excised and 
introduced into an appropriate plant transformation vector such as an 
Agrobacterium plasmid. In order to introduce the construct into common 
Agrobacterium plasmids such as Bin 19 (Bevan, Nucl. Acid Research (1984) 
12:8711-8721) it may be necessary to use one of the additional restriction sites in 
plasmid pPAW4. In one scenario the plasmid could be cut with Smal and Kpnl. 
The resulting purified fragment then is Iigated to a Kpnl oligonucleotide linker 
and digested withy Kpnl. This provides a non-<iirectional Kpnl fragment for 
introduction into Bin 19. Alternatively, the construct may be excised with Kpnl 
and BamHl and Iigated directionally into pBIN 19 previously cut with the same 
restriction enzymes. The resulting Agrobacterium binary plasmid is mobilized into 
a disarmed Agrobacterium strain by tripartite mating (Ditta, et al. (1980), PNAS 
77: 7347-7351) or DNA transformation of competent Agrobacterium (An, (1988), 
Plant Mol. Biology Manual, A3 1-19, Kluwer Academic, Dordrecht, Netherlands). 

The Agrobacterium harboring the recombinant Bin 19 is used to 
transform any susceptible plant, e.g., Brassica sp. by standard explant co- 
cultivation (Horsch et al. (1985), supra). The transformed cells are selected in 
culture with kanamycin using the co-transferred antibiotic resistance genes 
(neomycin phosphotransferase) also contained between the T-DNA borders of pBin 
19. These transformed cells are induced to regenerate whole plants by standard 
procedures (e.g. for an oilseed such as rapeseed. See, Moloney et al. Plant Cell 
Rep., (1989), 8: 238-242). The regenerated plants are permitted to flower and are 
self-fertilized (or may be cross-fertilized). In cases where the foreign DNA in the 
construct encodes a translatable product, this product may be isolated from 
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aqueous extractions of the mature seed and subsequent fractionation of the slurry 
by centrifugation (30 min at 100,000 xg). Depending on the desired product it' 
may partition with any one of the three phases obtained. It may be localized in 
the pellet, aqueous soluble phase or in the lipid film on the surface of the 
centrifuged sample. 

Alternatively, it may not be necessary to extract the product as the 
purpose of the expression may be to divert metabolism in the seed thus changing 
the phenotype of the seed (e.g. by altering size or colour of the seed, changing the 
ratio of ratty acid residues in the seed or interdicting a particular metabolic step 
considered to render the seed less useful or valuable. Such metabolic steps might 
include the production of antinutritional secondary products which reduce the value 
or desirability of the seed when present In such cases, the seed, per se, is simply 
harvested and used in accordance with usual procedures. 

15 Example 2 

A number of constructs containing varying amounts of the DNA 
sequence from the 5' transcriptional initiation region of the Arabidopsis oleosm 
gene joined operably to the coding region for /^-glucuronidase (GUS) were 
prepared using PCR. The constructs are designated according to the amount of 
the oleosin 5' region contained, for example, the 2500 construct has approximately 
2500 base pairs of the oleosin 5' region. The constructs were introduced into 
Brassica napus and tobacco and the expression of the /3-glucuronidase gene was 
measured as described in detail below. The GUS expression results of five 
constructs, the 2500, the 1200, the 800, the 600 and the 200 constructs in 
transformed Brassica napus plants are shown in Table I. A negative control 
(untransformed plant) is also shown. The GUS expression results of two 
constructs, the 2500 and the 800 constructs, in transformed tobacco plants are 
shown in Table IL Table in shows the developmental timing of the expression of 
the oleosin promoter in transgenic embryos. 

The constructs were made using standard molecular biology 
techniques, including restriction enzyme digestion, ligation and polymerase chain 
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reaction (PCR). As an illustration of the techniques employed, the construction of 
the 800 construct is described in detail. 

In order to obtain a DNA fragment containing approximately 800 
base pairs from the 5' transcriptional initiation region of the Arabidopsis oleosin 
gene in a configuration suitable for ligation to a GUS coding sequence, a PCR 
based approach was used. This involves the use of the polymerase chain reaction 
to amplify the precise sequence desired for the expression analysis. To perform 
the necessary PCR amplification, two oligonucleotide primers were synthesized 
(MilUgen-Biosearch, Cyclone DNA synthesizer) having the following sequences: 

Pst 1 oleosin seq 
5' primer: 5 ' CACTQCAGGAA CTCTCTGGTAA 3' 
(GVR10) 

The italicized bases correspond to nucleotide positions -833 to -817 
in the sequence reported in Fig. 2A. The additional nucleotides 5' of this 
sequence in the primer are not identical to the oleosin gene, but were included in 
order to place a Pstl site at the 5' end of the amplification product. The Pstl site 
is underlined. 
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30 



sequence: 



A second (3') primer was synthesized which had the following 

3' primer (ALP 1) 

BamHl oleosin seq 

5-CrAC££GQQATCCTGTrrACTAGAGAGAATG-3 
Smal 



This primer contains the precise complement (shown in italics) to 
the sequence reported in Fig. 2A from base -13 to -30. In addition, it contains a 
further 13 bases at the 5' end. This sequence is not complementary to the oleosin 
gene, but was added to provide two (overlapping) restriction sites, Smal and 
BamHl, at the 3' end of the amplification product to facilitate cloning of the PCR 
fragment. 
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These two primers were used in a PCR amnlificatinn rrarrinn tn 

— x — — 

produce DNA fragment containing the sequence between nucleotides -833 and -13 
of the oleosin gene with a Pstl site at the 5' end and Srnal and BamHl sites at the 
3' end. PCR amplification was performed using the enzyme Taq polymerase 
5 (Perldn-Hmer-Cetus) using the conditions recommended by the enzyme 

manufacturer and a temperature program of 92°C (denaturation) 1 min, 55°C 
(annealing) 1 min and 72°C (elongation) 1 min. The template was the oleosin 
genomic clone shown in Figure 2B, top panel, which in the original X library 
isolate contained approximately 15 ldlobases of Arabidopsis DNA. 

10 The amplification product (OLEO p800) was gel purified on 0.7% 

agarose, recovered using the glass bead method of Vogelstein and Gillespie 
(Preparative and analytical purification of DNA from agarose. Proc. Natl. Acad. 
Sci. USA 1979 76:615-619) and digested with Pstl. The digestion product was 
gel purified and end filled using DNA polymerase Klenow fragment then cut with 

15 Smal to produce a blunt ended fragment This was cloned into the Srnal site of 

pUC19 to yield the plasmid pUC OLEOp800. Using the asymmetric positioning 
of the Accl site in the insert (at the position corresponding to -649 in the oleosin 
gene as shown in Figure 2B) it was possible to select both orientations of insertion 
into pUC vector. The clone having the insert oriented such that the 5' most end 

20 of the amplified fragment (in the direction of transcription) is proximal to the 

unique Hind IH site in the puC19 cloning vector and the 3' most end of the 
amplified fragment is proximal to the unique Eco RI site in the pUC19 closing 
vector. 

The resulting plasmid was then cut with BamHl to yield the 
25 fragment OLEOp800 flanked by BamHl sites. This fragment, BamHl-OLEO800, 

was cloned into the BamHl sites of a BamHl digested plasmid designated 
HspGUS1559. HspGUS1559 is a plasmid used as a binary vector in 
Agrobacterium, derived from the vector pCGN 1559 (MacBride and Summerfeldt, 
1990, Plant Molecular Biology, 14, 269-276) with an insert containing heat shock 
30 promoter (flanked by BamHl sites), the ^-glucuronidase open reading frame and a 

nopaline synthase terminator (derived from pB1221, Jefferson RA in Cloning 
Vectors 1988, Eds. Pouwels P., Enger-Valk BE, Brammer WJ., Elsevier Science 
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Pub BV, Amsterdam section VH, Aill). BamHl digestion of HspGUS1559~ 
results in the release of the heat shock promoter and permits the insertion of any 
other BamHl fragment in its place. The BamHl-OLEOp800 fragment was ligated 
into this site to yield the Agrobacterium pOLEOp800GUS1559. This plasmid was 
5 used to transform E. coli and the amplified plasmid was introduced into 

Agrobacterium (strain EHA101) by electroporation as described above (Rogers et 
aL, 1988, Plant Molecular Biology Manual, A2: 1-12, Eds. Gelvin S. and 
Schilperoort, R. Kluwer Academic, Dordrecht, Netherlands). 

The resultant Agrobacterium strain (EHA 101 x 

10 pOLEOp800GUS 1559) was used to transform Brassica napus plants by the method 

of Moloney et al. (Moloney, M.M., Walker, J.M., Sharma, K.K. (1989) Plant 
Cell Reports 8:238-242) or tobacco plants by the method of Horsch et al.(Horsch 
et aL Science (1985) 227:1299-1302). The resultant transgenic plants were 
allowed to set seed, and GUS expression assays (Jefferson R.A. (1987), Plant 

15 Mol. Biol. Rep. 5 387-405) were performed on the developing seeds and also on 

non-reproductive plant parts as controls. GUS expression reported is an average 
obtained from approximately five seeds from each of approximately five different 
transgenic plants. 

The other constructs were prepared by the same PCR method 
20 described above using the appropriate primers for amplifying the -2500 fragment, 

the -1200 fragment, the -600 fragment or the -200 fragment. The results in 
Brassica napus expressed as specific activity of GUS enzyme are shown in Table 
I. The results in tobacco are shown in Table n. 

These results demonstrate that the oleosin fragment from -833 to - 
25 813 used in the 800 construct contains sufficient information to direct seed-specific 

expression of a reporter gene in transgenic Brassica napus embryos as early as 
heart stage and that the Arabidopsis oleosin promoter is capable of directing 
transcription in plants other than Arabidopsis. These experiments also show that 
the sequences present in this promoter construct contain the cis elements required 
30 for an increase in transcription in response to the addition of abscisic acid, a 

characteristic of the native oleosin promoter. 
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It should be noted that the seed-specific expression demonstrated 
here does not depend on interactions with the native terminator of an oleosin gene 
3' end. In this example, the 3' oleosin terminator was replaced by a tenninator 
derived from the nopaline synthase gene of Agrobacterium. Hius, the sequence in 
► the 800 construct is sufficient to drive the desired expression profile independent 

of ancillary sequences. 
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GUS Activity On pmol product/min/mg protein) 
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Seed Root Leaf Stem Seed (Late- 
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1,506 
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260 
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300 
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30 
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'ABA is treatment for 24 hours with l(r 5 M abscisic acid prior to GUS activity 



measurement 
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Table R 

Seed Spe cific Exp res sion in Tnhatv -n 



Promoter/GUS GUS Activity Cm pmol product/min/mg protein) 
Constructs Mature Seeds 

2500 11,330 

800 10,970 
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Table HE 

Developmental Expression in tomd™ n»p»c 

GUS ACnVITY (in pmol product/min/mg protein)' 
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Construct 


Heart 
Stage 


Torpedo 
Stage 


Cotyledonary 
Stage 


Co tyledonary 
Stage 


Cotyledonary 
Stage 


20 


2500 


272 


1207 


2541 


1819 


11,607 




1200 


124 


262 


388 


5094 


8,980 




800 


149 


260 


962 


2617 


7,128 




600 


59 


41 


29 


38 


1,365 




200 


30 


25 


15 
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11 


25 
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11 


14 
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All publications and patent applications mentioned in this specification 
are herein incorporated by reference to the same extent as if each individual 
publication or patent application was specifically and individuaUy indicated to be 
incorporated by reference. 

Hie invention now being fully described, it will be apparent to one of 
ordinary skill in the art that many changes and modifications can be made thereto 
without departing from the spirit or scope of the appended claims. 
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1. A method of expressing a DNA sequence of interest in a seed cell, said 

method compiising: 

growing a plant capable of developing seed, wherein said plant comprises 
cells containing integrated into their genome an expression cassette comprising in the 
5'-3Mtoctai of transcription as operably linked components, a transcriptional 
regulatory region obtainable from an oil body protein gene, a DNA sequence of 
interest heterologous to said regulatory region, and a transcriptional termination 
region, under conditions whereby seed is produced in which said DNA sequence is 
expressed under transcriptional control of said regulatory region. 

2. The method according to Claim 1, wherein said gene is expressed during 
a phase of embryogenesis which precedes accumulation of storage proteins. 
15 3 - The method according to Claim 2 wherein said phase is from the 

formation of a globular embryo through to early cotyledonary stage. 

4. The method according to Claim 1, wherein said gene is from a 
dicotyledenous plant 

5. The method according to Claim 2, wherein said phase is selected from 
the group consisting of globular, heart, torpedo and cotyledonary. 

6. The method according to Claim 4, wherein said dicotyledonous plant is 
Brassica napus or Arabidopsis. 

7. The method according to Claim 1, wherein said plant is other than 
Arabidopsis. 

25 8 - A DNA construct comprising: 

a chimeric gene comprising a transcriptional regulatory region obtainable 
from a gene which encodes an oil body protein fused to a DNA sequence of interest 
heterologous to said regulatory region. 

9. The DNA construct according to Claim 8, wherein said gene is 
30 obtainable from a member of Brassicaceae. 

10. The DNA construct according to Claim 8, wherein said transcriptional 
regulatory region is obtainable from Brassica napus or Arabidopsis. 
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11. An expression cassette comprising: 

as operably linked components, a transcriptional regulatory region obtainable 
from an oil body protein gene, a DNA sequence of interest heterologous to said 
regulatory region, and a transcriptional termination region. 

5 12. An isolated transcriptional regulatory region obtainable from an oil 

body protein gene. 

13. The isolated transcriptional regulatory region of Claim 12 wherein said 
gene is obtainable from Brassica napus or Arabidopsis. 

14. A plant comprising: 

10 cells containing integrated into their genome a chimeric gene comprising a 

transcriptional regulatory region obtainable from an oil body protein gene, wherein 
said transcriptional regulatory region is fused to a DNA sequence of interest 
heterologous to said regulatory region. 

15. Seed comprising: 

15 cells containing integrated into their genome, a chimeric gene comprising 

a transcriptional regulatory region obtainable from an oil body protein gene, wherein 
said transcriptional regulatory region is fused to a DNA sequence of interest 
heterologous to said regulatory region. 

16. Seed according to Claim 15, wherein said transcriptional regulatory 
20 region is from an oil-body protein gene obtainable from Brassica napus or 

Arabidopsis. 

17. Seed according to Claim 15, wherein said seed is dicotyledonous. 

18. Seed according to Claim 16, wherein said seed is an oilseed. 

19. A method for altering seed-specific metabolism, said method 

25 comprising: 

growing a plant capable of developing seed, wherein said plant comprises 
cells containing integrated into their genome an expression cassette comprising in the 
5'-3' direction of transcription, a transcriptional initiation region from an oil body 
protein gene, a DNA sequence of interest other than a sequence native to said 
30 initiation region, and a transcriptional termination region, under conditions whereby 

seed is produced in which said DNA sequence is expressed under transcriptional 
control of said transcriptional initiation region. 
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20. The method according to Claim 19. wherein sairf pUp™* ic ,«^„„;„„ 

aw 1.WIUWA11^ 

or suppressing expression of endogenous genes expressed in plant seeds. 

21. The method according to Claim 19, wherein said transcriptional 
initiation region includes a silencer element 

5 22. The method according to Claim 19, wherein a transcribed strand of said 

DNA sequence is complementary to mRNA endogenous to said cells. 

23. A method for producing novel polypeptides in seed, said method 
comprising: 

growing a plant capable of developing seed, wherein said plant comprises 
10 cells containing integrated into their genome an expression cassette comprising in the 

5'-3' direction of transcription, a transcriptional initiation region obtainable from an 
oil body protein gene, a DNA sequence of interest encoding a polypeptide foreign to 
said plant, and a transcriptional termination region, under conditions whereby seed 
is produced in which said DNA sequence is expressed under transcriptional control 
15 of said transcriptional initiation region. 

24. A plant part, comprising a DNA construct according to Claim 8. 

25. A plant part according to Claim 24, wherein said part is a leaf, stem, 
root, flower, fruit or seed. 

26. An isolated DNA comprising: 

20 a transcriptional regulatory region which provides for expression of a DNA 

sequence of interest during a phase of embryogenesis which precedes accumulation 
of storage proteins, wherein said regulatory region is obtainable from oE body 
protein. 

27. The isolated DNA according to Claim 26, wherein the nucleotide 
25 sequence of said DNA comprises: 

a nucleotide sequence from nucleotide -867 to nucleotide 1 of the genomic 
clone shown in Figure 2A. 

28. A method for expressing a DNA sequence of interest in a host plant 
during a phase of embryogenesis which precedes accumulation of storage proteins, 

30 said method comprising: 
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transforming said host plant with a construct comprising a DNA sequence of 
interest operably fused to a transcriptional regulatory region, wherein said regulatory 
region is obtainable from an oil body protein gene; and 

growing said plant under conditions whereby seed is produced and said DNA 
5 sequence of interest in expressed under transcriptional control of said regulatory 

region. 

29. The method according to Claim 28 wherein said host is a 
dicotyledenous oil seed cell. 

30. The method according to Claim 29, wherein said gene is obtainable 
10 from the group consisting of 

(a) Brassica napus; 

(b) Zea mays; 

(c) carrot; and 

(d) Arabidopsis. 

15 3L A method for obtaining a purified polypeptide of interest, said method 

comprising: 

transforming a host plant cell with a DNA construct under genomic integration 
conditions, wherein said DNA construct comprises a first DNA sequence encoding 
a polypeptide of interest inserted in reading frame into an oil body protein (OBP) 
20 genewhichincludesasufficientportionof the regulatory region5' tothetranslational 

start site of said OBP gene to provide for expression of said gene in seed, wherein 
said sequence is inserted at a site in said gene so that expression of said DNA 
sequence is controlled by said regulatory region, whereby said DNA construct 
becomes integrated into the genome of said plant cell; 
25 growing said plant to produce seed whereby said polypeptide of interest is 

expressed as a fusion protein with the expression product of said OBP gene; 

isolating oil bodies from the cells of said seed; 

disrupting said oil bodies whereby said fusion protein is released; and 

purifying said polypeptide of interest. 

30 32 - A method for obtaining a polypeptide of interest in an oil body, said 

method comprising: 
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expressing said polypeptide in seed as a fusion 
sufficient portion thereof to provide targetting to said oil body. 
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